Proceedings of the International Conference on Image Processing , 1996 STRUCTURE - PRESERVING DOCUMENT IMAGE COMPRESSIONOmid
نویسندگان
چکیده
Maintaining a document in image form is often preferable in order to avoid the high cost of manual conversion or the introduction of large numbers of errors by automatic OCR and/or graphics interpretation. The large volume of data in the image can be greatly reduced by using compression techniques. Text-intensive document images typically have a great deal of redundancy in the bitmap representations of symbols, and we make use of that redundancy for compression by clustering components, representing each cluster by a template and encoding the error. Our method is novel in modeling the error associated with each cluster and in preserving structure, an important component for readability and processing. Document image compression for storage or transmission is an important research topic, in part because of the relatively large amounts of data that documents contain. Text-intensive document images typically have signiicant amounts of symbol-level redundancy as characters of the same size and font repeat in the text. Our research centers around exploiting the redundancy to provide a compressed representation which preserves document-level features necessary for later processing and analysis 5]. Many image compression schemes are widely available. For binary document images, CCITT Group 3 and 4 are among the industry standards 8]. These standards are lossless compression schemes which were proposed for fax and modem use. They use variations of run-length encoding for optimized binary image transmission. For grayscale and color images, the JPEG standard 6] is a widely accepted lossy compression scheme. This method works well with textured images but does not exploit the structural redundancies within a document. In general, methods of image (a) (b) Figure 1: a) Symbolic image, b) Error image coding based on image models (e.g., 2]) do not exploit the redundancies that exist in documents and may render a document unreadable. Since many document processing algorithms degrade at resolutions below 300 dpi, simply reducing the resolution to save space is not necessarily an option. An important consideration is the preservation of structure so that symbols remain recognizable. In this paper we use a structural approach to compress textual images. The general approach, suggested in part by Ascher and Nagy 1] and later enhanced by Witten 7], rst clusters text-like components and represents each cluster by a single template (presumably of the underlying symbol); the residual error is then encoded (Figure 1). Our contribution is the use of a probabilistic model of the errors. In …
منابع مشابه
Document Image Dewarping Based on Text Line Detection and Surface Modeling (RESEARCH NOTE)
Document images produced by scanner or digital camera, usually suffer from geometric and photometric distortions. Both of them deteriorate the performance of OCR systems. In this paper, we present a novel method to compensate for undesirable geometric distortions aiming to improve OCR results. Our methodology is based on finding text lines by dynamic local connectivity map and then applying a l...
متن کاملProceedings of the International Conference on Pattern Recognition , volume C , pages 664 - 668 , 1996 Structural Compression for Document
In this paper we describe a structural compression technique to be used for document text image storage and retrieval. The primary objective is to provide an eecient representation, storage, transmission and display. A secondary objective is to provide an encoding which allows access to speciied regions within the image and facilitates traditional document processing operations without requirin...
متن کاملAutomatic road crack detection and classification using image processing techniques, machine learning and integrated models in urban areas: A novel image binarization technique
The quality of the road pavement has always been one of the major concerns for governments around the world. Cracks in the asphalt are one of the most common road tensions that generally threaten the safety of roads and highways. In recent years, automated inspection methods such as image and video processing have been considered due to the high cost and error of manual metho...
متن کاملSelective CRLA based Layout Analysis and Text Region Extraction from Low Quality Document Images
This paper aims at detecting textual regions by separating graphical regions using Selective CRLA scheme and statistical textual properties on noise infected and low resolution newspaper images. A Bottom Up approach is adopted (i.e.) Selective Constrained Run Length algorithm (CRLA) is applied to obtain the layouts and region growing method over it, segments the homogeneous regions. Statistical...
متن کاملDocument Image Retrieval Based on Keyword Spotting Using Relevance Feedback
Keyword Spotting is a well-known method in document image retrieval. In this method, Search in document images is based on query word image. In this Paper, an approach for document image retrieval based on keyword spotting has been proposed. In proposed method, a framework using relevance feedback is presented. Relevance feedback, an interactive and efficient method is used in this paper to imp...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996